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Abstract. We study the Kronccker sequence {na}n<N on the 
torus T"^ where a is uniformly distributed on T''. We show that 
the discrepancy of the number of visits of this sequence to a ran- 
dom box, normalized by In'' N converges as TV ^ oo to a Cauchy 
distribution. The key ingredient of the proof is a Poisson limit 
theorem for the Cartan action on the space of d + 1 dimensional 
lattices. 

1. Introduction 

1.1. Equidistribution of Kronecker sequences of T'^. It is known 
that the orbits of a non resonant translation on the torus T"^ = /l/ 
are uniformly distributed. A quantitative measure of uniform distribu- 
tion is given by the discrepancy function: given a set C C T'^ let 

N-\ 

Dc{x, a,N) = J2 Xc{x + ka) - Nyi{C) 

n=0 

where (x, a) e T*^ x T'', pL is the Haar measure on the torus and xc 

is the characteristic function of the set C. Uniform distribution of the 
sequence x + ka on T'^ is equivalent to the fact that, for regular sets C, 
Dc{x,a,N)/N ^ as iV — )■ oo. A step further in the description of 
the uniform distribution is the study of the rate of convergence to of 

Dc{x,a,N)/N. 

Already with = 1, it is clear that if a G T — Q is fixed, the dis- 
crepancy Dc{x,a, N) displays an oscillatory behavior according to the 
position of N with respect to the denominators of the best rational ap- 
proximations of a. A great deal of work in Diophantine approximation 
has been done on estimating the discrepancy function in relation with 
the arithmetic properties of a G T, and more generally for a G T'^. It 
is of common knowledge that in studying the discrepancies in dimen- 
sion 1 the continued fraction algorithm provides crucial help, and that 
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the absence of an analogue in higher dimensions makes the study of 
discrepancies much harder. 
In particular, let 



where the supremum is taken over all sets C in some natural class of 
sets B, for example balls or boxes. The case of (straight) boxes was 
extensively studied, and growth properties of the sequence D{a,N) 
were obtained with a special emphasis on their relations with the Dio- 
phantine approximation porperties of a. In particular, following earlier 
advances of [HI El HSl IHl HH] and others, pQ proves that for arbitrary 
positive increasing function 0(n) 



In dimension d = 1, this result is the content of Khinchine theorems 
obtained in the early 1920's [TT] , and it follows easily from well-known 
results from the metrical theory of continued fractions (see for example 
the introduction of [1]). The higher dimensional case is significantly 
more difficult and the cited bound was only obtained in the 1990s. 

The bound in ([T]) focuses on how bad can the discrepancy become 
along a subsequence of A^, for a fixed a in a full measure set. In a sense, 
it deals with the worst case scenario and do not capture the oscillations 
of the discrepancy. 

On the other hand, the restriction on a is necessary, since given 
any — > it is easy to see that for a G T sufficiently Liouville, 
the discrepancy (relative to intervals) can be as bad as A^„e„ along a 
suitable sequence Nn (large multiples of Liouville denominators) It is 
conjectured that for any a the discrepancy can be as bad as (InA^)*^ 
but not much is known better than the general lower bound (InA^)"'/^ 
that holds for every sequence on T'^ ([16J). Here again, due to the 
use of continued fractions the latter conjecture can be easily verified in 
dimension 1 (cf. discussion in pLj). 

In another direction, but still studying the discrepancy for a fixed a 
and along subsequences of N, [7j obtains a Central Limit Theorem in 
the one dimensional case of circle rotations. The results of [7] apply 
either for a set of a of zero measure (so called badly approximable 
numbers) and a set of times of large density, or for all a but for a small 
set of times (in both cases, the time sets depend on a). 



D{a,N) = supDc{0,a,N) 



(1) 




(In NY^iln In A^) almost every a G T'^. 



D{a, N) is bounded for 



n 
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By contrast, if one lets a and x be random then it is possible to obtain 
asymptotic distributions of the adequately normalized discrepancy for 
all N. 

This is the approach adopted by Kesten in |9l[T0] (see also f2]) where 
he studied the distribution of the discrepancies related to circular ro- 
tations as a and x are randomly distributed over the circle. He proved 
the following result. 

Theorem [91 do]. Let Dn{x, a) = J2n=o X[a,b]ix+ka)- N{b- a). There 
is a number p = p{b — a) such that if {x, a) is uniformly distributed on 
then converges to the standard Cauchy distribution, that is, 

Dn . . tan-^2; 1 

Moreover p{b — a) = pq is independent ofb — aifb — a^Q and it has 
non-trivial dependence on b — a if b — a ^ Q. 

Our goal is to extend this result to higher dimensions, and as in the 
case of other results related to discrepancies of Kronecker sequences, 
the main difficulty will come from the absence of a continued fraction 
algorithm that was also the main tool in Kesten's proof. 

Before we describe our approach, let us mention that there are two 
natural counterparts to intervals in higher dimension: balls and boxes. 
In [S] we considered the case where C is analytic and strictly convex and 
showed that Dc{x,a, N)/N^'^~^^^'^'^ has a limiting distribution (which 
however depends on C). 

Here we address the case where C is a box and show that Dc{x, a, N) / (h 
converges to a Cauchy distribution. To avoid the irregular behavior of 
the limiting distribution as the function of the considered box, as is 
the case in Kesten's result for example, we introduce an additional 
randomness to the parameters, by letting the lengths of the box's sides 
fluctuate. For a reason that will be explained in the sequel we have also 
to apply (arbitrarily small) random linear deformations on the boxes. 

More precisely, for u = (mi, . . . ,Ud) with < Ui < 1/2 for every i, 
we define a cube on the (i-torus by C„ = [— x . . . [—Ud,Ud]. Let 
T] > and MCu be the image of Cu by a matrix M G SL((i, M) such 
that 

M = (aij) & Gr, = {|aj^j — 1|, for every i and |ajj| < rj for every j ^ i}. 

For a point x G T'^ and a translation frequency vector a G T'^ we denote 
^ = (m, M, a, x) and define the following discrepancy function 

D(^, N) = i^{l<m<N ■{x + ma) mod 1 G MCu] - 2'^ {Jlim) N. 
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Fix d segements such that < Vi < Wi < l/2Vi = 1, . . . ,d. Let 

X = (m, a, X, {ttij)) G [f 1, wi] X . . . [vd, Wd] X T^'^ x G^. We denote by A 
the normahzed restriction of the Lebesgue x Haar measure on X. 



Theorem 1. Let p = (f j^*^^^ • For any 2; G M we have 

where C is the standard Cauchy cumulative distribution function 
^, , 1 r 1 tan-^z 1 

As will be clear from the proof, the same statement holds if A is re- 
placed by any probability measure on X with smooth density. Actually, 
we could replace the two perturbations of the box, the fluctuation of 
the sides' lengths and the application of an SL{d, M) matrix, by a single 
random linear perturbation, or by rM with r smoothly distributed in 
a neighborhood of 1 and M E Gn- We prefer to keep the perturbations 
split because their roles in the proof are quite different. 

Our proof of Theorem [T] shows that for typical a a quenched limit 
(that is, with fixed a and x uniformly distributed on T*^) of Dc{x, a, N) 
does not exist even if we would allow the normalizing sequence Un to 
depend on a. The reason is that the main contribution to the discrep- 
ancy comes from a small set of so called small denominators and, at 
different scales, different small denominators become important. Also, 
the number of the small denominators of a given size fiuctautes. There- 
fore there is a sequence of times when the descripancy is dominated by 
a single small denominator, so, after a proper normalization we get 
limiting distribution of compact support. On the other hand, we can 
consider a sequnce of times when there are many small denominators 
of approximately equal strength, in which case the limiting distribu- 
tion will be Gaussian. Since we can obtain different limit distributions 
along different sequences, no limit exists as — )■ 00. We note that the 
absence of quenched limits is often observed in zero entropy systems 

mum- 

1.2. Plan of the paper. We now give a description of the paper's 
content and of the main ingredients in the proofs. 

Section |2] contains preliminaries and reminders. In Section 2A_ 



we 



recall the representation of the Cauchy distribution in terms of a Pois- 



son process. In section 2^ we present Rogers formulas that allow to 
compute the average and higher moments for the number of points of 
a random lattice in a given domain. 
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In Section [3} harmonic analysis of the discrepancy's Fourier series 
allows to bound the frequencies that have essential contributions to 
the discrepancy and show that they must be resonant with a. After 
eliminating a small measure set of vectors a, for which the resonances 
are too strong we obtain that the good normalization for the discrep- 
ancy is (IniV)'^. The main result of Sectionjsjis Theorem [s] that reduces 
the proof of the main theorem to establishing a Poisson limit theo- 
rem for the distribution of small denominators and the corresponding 
numerators. 

Apart from section [3} all our proofs are identical in any dimension 
and in the 2-dimensional case. We therefore present the proof of the 
Poisson limit theorem in dimension 2 in the aim of improving the read- 
ability of the paper. Thus in sections |4]-[6] we assume that c? = 2 and use 
the notations {x,y), {u,v), instead of (xi, 0:2), ^2), (^i, ^2) 

and 

Ma^b^a2b2 ^Gr, = { - 1|, I62 - 1|, \a2\, |&l| < V, (^ih " ^102 = !}• 

In dimension two we need to prove the Poisson limit theorem for the 
sequence 

{(In^ A^(aifc + hil){a2k + h2l)\\ka + ll3\\,N{ka + 113) mod (2), {{aik + hi 

{{a2k + h2l)v},{kx + ly})} W 

when (fc, /) range over the resonant frequencies for (a, /3) that con- 
tribute to the discrepancy D{^,N), namely {k,l) such that 

|aiA; + > 1, \a2k + b2l\ > I, \{aik + bil){a2k + b2l)\ < N, 

(flifc -|- hil){a2k + h2l)^ka + l(3\\ < 2 — . 

e\n N 

In section 111 we reduce the Poisson limit of the first two coordinates of 



(t^t) to the Poisson limit theorem (Theorem 10) for the number of visits 



to a cusp by orbits of the Cartan action on M = S L?jiW) / S L'^^'L) . 



The proof of Theorem 10 is given in Section[6] Poisson limit theorems 
for dynamical systems is a popular subject. The most relevant for our 
purposes is paper [1] where a Poisson Limit Theorem is proven for 
partially hyperbolic systems assuming that the images of local unstable 
manifolds became equidistributed at sufficiently fast rate. 

In the present setting there are two new difficulties. First, the ge- 
ometry of the cusp is quite complicated (especially for large c?), in the 
sense that we do not know which k and / contribute to the resonances 
in {-k). However Rogers identities provide sufficiently strong control to 
handle this issue. Secondly, in the higher rank case (we need to con- 
sider the action of the full diagonal subgroup of S'L3(]R) because, for 
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a typical resonance, aik + 62^ and + 62^ have very different sizes) 
there is no notion of "unstable manifold" because there is no notions 
of "future" and "past" and going to infinity in different Weyl chambers 
gives different expanding and contracting directions. In the present 
setting, we are able to prove a Poisson limit theorem using the fact the 
long leaves of the Lyapunov foliations become uniformly distributed at 
a polynomial rate, except, possibly, for a small measure set. 

The relevant equidistribution results for unipotent subgroups of S'L3(]R) 
acting on the space of three dimensional lattices are presented in Sec- 
tion [H 

Unfortunately, the possible existence of small exceptional sets, re- 
quires us to introduce additional parameters in the form of small affine 
deformations of the box. This enables to deal with a Poisson Limit 
Theorem for lattices having a smooth density on M whereas if we 
work with the straight boxes we would have to establish a Poisson 
Limit Theorem for lattices having a smooth distribution on a positive 
codimension sub manifold of M. 

Finally, to prove the Poisson limit for all components of {-k), we need 
to show that the remaining components are asymptotically independent 
of the first two. This requires an extra work but the argument is similar 
to the original analysis of Kesten. 

In section [7] we discuss the discrepancy for the number of visits to 
boxes of small size N~'^, 7 < 1/d, and we obtain a similar result to the 
case 7 = that corresponds to the main theorem [l} The case 7 = l/rf 
was studied in [13] where a limit distribution was obtained without any 
normalization. As for the case 7 > l/d, it is vacuous since most orbits 
do not visit a ball of size before time (by the Borel Cantelli 
Lemma) . 

Finally, in Section [8] we discuss the continuous time case, that is, we 
study the discrepancies corresponding to linear flows on the torus. We 
show that in case of boxes the discrepancy is bounded in probability 
since the indicator function of a box is a coboundary with probability 
one. We actually get convergence in distribution of the discrepancies 
without any normalization. However, our proof of Theorem [T] implies 
a Cauchy limit theorem for continuous discrepancies relative to balls, 
and this only in dimension d = 3. Indeed, the latter is in sharp contrast 
with the higher dimension case obtained in [5j that states that for d > 4 
the continuous discrepancies relative to balls converge in distribution 
after normalization by a factor T^'^^^^^'^^'^^^K 



2. Preliminaries. 
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2.1. Poisson process. The proofs of the facts hsted below can be 
found in monographs [171 HE] . 

Let {X, jj) be a measure space. Recall that a Poisson process asso- 
ciated to (X, /i) is a point process with values in X such that 

(a) if N{A) is the number of points in A (Z X then N{A) has Poisson 
distribution with parameter ^{A) and 

(b) if Ai, ^2 . . . Afe are disjoint subsets of X then N{Ai),N{A2) . . . N{Ak) 
are mutually independent. 

If X C M*^ and /i has a density / with respect to the Lebesgue 
measure we say that / is the intensity of the Poisson process. 

Lemma 2. 

(a) If {Qj} is a Poisson process on X and ip : X X is a measurable 
map then Qj = is a Poisson process. If X = X = 'R and if {Qj} 
has intensity f and if ip is invertible, then the intensity of B is 

(b) Let (BjjPj) be a point process on X x Z such that {Qj} is a 
Poisson process on X and {Tj} are Z -valued random variables which 
are i.i.d. and independent of {Qk} then {Qj,Tj) is a Poisson process 
on X X Z. 

(c) Conversely if (Gj , Vj) is a Poisson process on XxZ with measure 
HXu where v is a probability measure then Qj is a Poisson process with 
measure /i and Tj are iid independent of Qs and having distribution v. 

(d) If in (b) X = Z = M. then 6 = {PjOj} is a Poisson process. If 
{Qj} has intensity f then has intensity 

ho) = ^v[f{p ^ 

Lemma 3. 

(a) If {Qj} is a Poisson process on M with intensity c6~'^ then 

lim- V e, 

^ &<\Bi\ 

has a standard Cauchy distribution, with p = en. 

(b) If {Oj} is a Poisson process on M with constant intensity c and 
if Tj are iid random variables having a symmetric distribution with 
compact support then 

hm - -f 

^ |ej|<M ^ 

has a standard Cauchy distribution with p = cE(|r|)7r. 
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The proof of Lemma [s] (b) follows from Lemma [s] (a) and parts (a) 
and (d) of Lemma |2j 



2.2. Rogers identities. The following identities (see [I3l|20]) play an 
important role in our argument. Denote 



c,=C{d + l)-\ C2 = C(c? + l)"', where C(c?+1) = J]^" 



n=l 

is the Riemann zeta function. 



Lemma 4. Let /, /i,/2 be piecewise smooth functions with compact 
support on Mf^^^. For a lattice C C M'^^^ let 

F{C)= /(^)' ^(^)= E /i(ei)/2(e2). 

e££, prime eiy^±e2S:C, prime 



(a) / F{C)dfi{C) = Ci f{x)dx, 



Then 



(b) / F(£)d/x(L) = C2 / h{x)dx f2{x)dx. 
(c) Consequantly 

[ F\C)dfx{C) = ci / f{x)dx+ci [ f{x)f{-x)dx+C2 ( 

JM JR'i+i jRd+i \ 

3. Estimating the contribution of non-resonant terms. 



/( 



The goal of this section is to reduce of the proof of the main Theo- 
rem [T] to proving a Poisson limit distribution of a point process related 
to the resonant terms to the discrepancy function (Theorem |5]). The 
subsequent Sections |4]-[6] will be dedicated to the proof of Theorem [5] 

3.1. Recall the definition 

X = {{u, a, X, (aij)) G [vi, wi] x . . . [vd, Wd] x T^"' x G,^} 
Let $„(w) := SM?ZE!!M^ For ^ G X and k E %\ we use the notation 

(3) ki = ai^iki H h ai^dkd 

Let 

U,{i,N) = An,$^^(.x,) '^^^''^[^'"|^ cos(27r(A;,x) + 
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where ifk,a,N = 7r{N - l)(/c,a)/2, A = ^ and {k,x) := Yfi=iki^i- 
Writing the Fourier series of the characteristic function of a box we get 
that 

/c6Z'*-{0} A;eZ'*-{0},fei>0 

3.2. Let 

\ki\<N,ky^{0,...,Q) 

We claim that there exists a constant C such that 

\\D-Di\\l < C 

where the norm refers by default to functions of the variables (a, x) G 
T^"^. As a consequence we can replace D by Z^i in (|2]). 

Proof of the claim. Assume ^ & X given. Then for any q > N and any 
qi, . . . , qd-i G N there exists only finitely many G Z'' such that ki^ = q 
and ki- G [qj, qj + i] for every j G [1, (i— 1], where ij is some permutation 
of the indices 1, . . . ,d. Since for any u, |$,„(u;)| < min(27r|c(;|, 1/m), the 
contributions of the latter frequencies can thus be bounded as follows 

\\n-n\\'^<r V ^ f ( 

q>N,qi,...,qd-i>0^ ' W ^ ' J 

3.3. Define S{^,N) = S{{aij),N) := {A; G : \ki\ < N,\ki\ > !}• 
Then let 

kes 

We want to show that it is possible to replace the study of Di by that 
of D2. For a fixed matrix (ajj), we want to bound the contributions 
of frequencies k such that ki^ < 1 for at least one index G [l,d]. 
Observe first that since (ciij) is close to Identity then ki < 2N for 
every i. Moreover, there exists C{d) such that for every (gi, . . . , g^^i) G 
[0, 2A^]'^-^ there is at most C{d) vectors k G [-A^, A^]'^ such that \kij < 
1 and \ki.\ G [qj,qj + 1] for every j G — 1], where ij is some 
permutation of the indices l,...,d. We call Kq^^ ^^q^_^ the latter set 
of k. We then exclude the translation vectors a for which there exists 
(gi, . . . , G [0,2A^]'^~^ with at least one k G Kq^^,,,^q^_^ satisfying 
\^izl{qi + < e/(lnA^)'^~^ The excluded set E^((aij)) has 

Lebesgue measure of order e. 
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We claim that 

WD D\? _ / (IniV)^(^-^) 

11^2 - ^l\\L2{{Ti-Et,)xTd) - ^ I ^ 

Therefore we can replace Di by in ([2]). 
Proof of the claim. Let 

Bp {{qi, . . ■,qd-i), {ai,j)) = {a e T"^ : 3k e ((flij)), 

pe/{\nNY-' < |nti'(g, + l)|||(fc,a)|| < {p + l)e/ {In N)^-'} 

then Leb {Bp {{qi, qa-i), {aij))) < C (g,+i)...fa_,%i)(injv)''-i • Hence 

^ V V {\nNf('^-^-> 

^ ^ (gi + 1) . . . {qd-x + 1) (In NY"^ e V 

^^(lnAO^_ ^ 

~ e 

3.4. Let A^) := {k : nf^i|A;i| < iV, \ki\ > 1}. Then let 

Ds{^,N) = Y,m,N). 

We claim that there exists a constant C > such that 

11/^3 -/^2|l2<C(lniV)^-i. 
Therefore we can replace D2 by D3 in ([2]). 
Proof of the claim. Denote K{k) = Uf^^ki. We have that 



N 



, ^ ^, K{kY Jjd\ sin(7r(/c, a)) 



1 /■ /'sin(7rA^(A;,a)) 

)\>N 



2 



k&S,\K(k)\>N ^ ' 

Let 

A, = {k^S: \K{k)\ G [2^Ar,2^+^Ar]} 
and observe that < C2"A^(ln + sY~^ . Thus 

°° TV 
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3.5. Define T(^, N) = T{{aij), a, N) by 

:= j/c G 5((a„),iV) : \UUk\\\ik, a)\\ < ^^y^} 

and let 

We claim that 

therefore to prove ([2]), we can study the limiting distribution as — cxd 
of Di/(\nNY. 

Proof of the claim. Since k & S and (aij) is close to Identity we have 
that 1 < \ki\ < 2N for every i. Now, for every qi,...,qd G [1,2A^]'^ 
there are at most C{d) vectors k G [—N, such that | fci] G [qi, qi + l]. 
We denote the latter set of vectors K{qi, . . . ,qd). We have that 

{qi,...,qdMl,2N]'i 

where 

^^^^^'■■•'"^ ^ ,,^_Jt^ (ntig.||(fc,«)||)2'^nt,..||(.,.)||>iMin^)^^«- 

Consider for each A; G v4x(gi,...,grf) and p G N the sets 



= a G r : Vrnd ^ ntig.|| (fc, a) || < 



We have that Lehj^Bk^p) < 1 / {eUf^-^qi{\n N^) . Thus 

1 f. e^(lniV)^'^ ^ (IniV)'^ 

- ^^nt,g,(lniV)'^ ^ p2 - ntig, 

and the claim follows as we sum over (gi, . . . , g^) G [1, 2N]'^. □ 

3.6. Let 5 = e\ Define A^) = W{{aij), a, A^) by 

W{^,N) := {A; G : |nti^i| < N'^~\ 

yt = i,...,d, > i,|nti^i|||(A;,«)|| < ^ 



and let 
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Let 

FN{{aij)) = EN[j{a eT'^ ■.3k eT such that 

\Utiki\\\{k,a)\\ < 



(In NY 



Since for any qi,...,qd G [1,2A^]'^ there are at most C{d) vectors 
k E S such that ki G [qi, qi + 1] we get that 



Ce 

rin A 

l<qi,...,qd<N 

We claim that 



Ce 

Leb,. (F^) < Leb,. {E^) + . ^^.^-^ ^ < Ce. 



Therefore to prove ^ we can study the hmiting distribution as — )■ oo 
of Ds/QnAT)^. 

Proof of the claim. For every k G Z'^, we have that 

Leb ja G T*^ : |nti^i|||(A;,a)|| < — ^— I < ^ 



eOnAT)"! J - |ntiA;i|e(lnA^)'^' 

Hence the contribution of the A; G T — 14^ for a G T'^ — Fn can be 
bounded by 



I-D5 -D4|lL2((ird-Fjv)xT'*) — 



^ (lnA^)2'^ 1 



For s = 0, ...,[51nA^], define 

P, = {keT : |ntiA;i| G [2'N^-^,2'+^N^-^]}. 

Then #(Ps) < C2"A^i-'^(ln A^)'='-^ Thus the terms in contribute to 
II-D5 - ^4||L2((T2_Fiv)xT2) with less than 

(InAT)'^ ^^ (InAr)^'^-^ 
Summing over s = 0, . . . , [51n A^] we get the required estimate. □ 
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3.7. Observe that that given e for each r/ > there is a number 72(77) 
such that 

mes(^ such that ij^{W{i,N)) > n{ri)) < rj 

uniformly in N. Since the contributing terms in satisfy ||(/c,a;)|| < 
eiinNy ' replace t/fc in the definition of A by 

and consider instead of in ^ 

fceVK,fci>o 



Next, we let 



N) = {ke W{i, N),ki>Q:^meZ such that A;i A . . . A k(i /\ m — \ 

and II (A;, a) || = (A;, a) + 771} 



and we can replace the study of j^^^yi by 
where 

0fc(e,iV) = nti^.||(A:,a)||(lniV)^ 

rfc(^, N) = (f) {hui, • • • , kdUd, N{k, a), (fc, x) + (fk,a,N) , 

and 
(4) 

, . . [^f=i sm{27ijrii)] sm{njrid+i) cos{2njria+2] 

0(7/1, ...,T]d, T]d+l,Vd+2) = 

Note that 115^^011 < C, for any i = 1, . . . ,d + 2. The difference 
between D7 and Dq is that for k E Z,we comprise in D-j all its multiples 
whereas in Dq we take only multiples such that jk G W. This does not 
make any difference in the limit because ^ < 00 and because we 

can of course add to Dq the multiples of k such that \\j{k, a)\\ < ^if^^^^y 
with e' e which accounts for most of the sum in 6. 
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3.8. By the general facts about Poisson processes hsted in section 2A_ 
Theorem [l] follows from the next result 

Theorem 5. For any 6,6 > we have that as N ^ oo and ^ E X is 
distributed according to the normalized Lebesgue measure X, the process 

{ {{In NyUiki\\{k, a)\\,N{k, a) mod (2), {hui}, . . . , {kdUd}, 

{{k,x) +ipk,a,N})}keZi^,N) 

converges to a Poisson process on [—-,-] x M/(2Z) xT'^+i with intensity 
2'^^^{1 —SYci/dl where Ci = l/({d+l) is the constant from Lemma^ 

Here and below when we consider the Poisson process on a real line 
times a torus the intensity is always computed with respect to the 
Lebesgue measure on the lines times the Haar measure on the torus 
(that is the measure of the whole torus is equal to one). This normal- 
ization is convenient since in Lemma [2] we need to have a probability 
measure on the second factor. 

Note that by standard properties of weak convergence the result 
remains valid for e = 6 = 0. That is, we get the following result which 
is of independent interest 

Corollary 6. Let ^ & X be ditributed according to the normalized 
Lebesgue measure A. Then as N ^ oo the point process 

{ ((In Ar)'^n,^,|| (A;, a)\\,N{k, a)mod(2), {^i^i}, . . . , {kdUd}, 

{{k,x) +ifk,a,N})}keZ'(i,N) 

where 



Z*{^, N) = {keZ'^ : \k,\ > 1, \Uiki\ < N,ki> 0, 
\m\\ik,a)\\<^^^, 
3m E Z such that ki A . . . A k^ A m = 1 and || (A;, a) || = {k, a) + m} 



converges to a Poisson process on M x M/(2Z) X T^+i with intensity 
2'^-^Ci/d\. 



3.9. Proof of Theorem [T| We have that (a) and (c) of Lemma |2] 
imply that {Qk{^, ^)}fcGZ converges to a Poisson process on [— ^, ^] and 
that {rfc(^, N)}kfzz are asymptotically iid independent from the 9s and 
have a symmetric distribution with compact support. Hence Lemma 
|3] applies and yields Theorem [TJ Note that the limiting distribution 
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of Dj/{\nNY has form -^^^ Ylij Observe that due to Q we have 



E(r) = C{d + 1) [lY^^ and hence 



1 1 /2^'''+2 



3.10. The case d = 2. Notations. Since the proof of Theorem |5] is 
the same for general d as for the case d = 2, we specify in the sequel 
to the latter case. In our opinion, this will improve the readability of 
the proof to which sections |4]-[6] are devoted. There, we will prove the 
following version of Theorem |5] in the case d = 2. 

Recall from the introduction the notations {x,y), {u,v), {a, (3) in- 
stead of x,u,a and 



^aibiaafca ^ ^r, = { jfli - 1|, |fe2 " l|, \a2\, |&l| < V, ^1^2 " ^102 = l}- 

Let 

X = {{{u, v), («, P), {x, y), (a,,,-)) e [-u, u] x [~v, v] x x x 
We denote by A the normalized Lebesgue measure on X. 

Theorem 7. For each 6,6 > the following holds. As N ^ oo and 
C, E X is ditributed according to X, the process 

{(in^ A^(aiA; + bil){a2k + b2l)\\ka + ll3\\,N{ka + l(5)mod{2), {{ayk + 

{{a2k + b2l)v], {kx + ly + Vi,k,c>,^,N})}{k,i)ez(i;,N) 

converges to a Poisson process on [— ^, ^] x with intensity (1 — 6)^Ci 
where Ci = 1/C(3) is the constant from Lemma^ Here 

Z{^,N) = {{k,l) G : |aiA; + 6i/| > 1, laafc + foa^l > 1, 



\{aik + bil){a2k + 62/)! < N^'\aik + bil > 0, 
\{aik + bil){a2k + b2l)\\\ka + lf3\\ < ^.x^ , 
3m E Z such that k A I A m = 1 and ||/ca; + = ka + 1(3 + m} . 



4. Reduction to dynamics on the space of lattices. 

Denote M = [InN]. 

Introduce the following notations 



;i,e], J= [-e,-l)U(l,e], K 



1 1 
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fx) 






)l 


y 








\^ 1 




[ml 



e-*i 

(5) gt,M = I e-*^ 

e*i+*2 

ai bi 
A = I a2 62 

a (3 1 

Define on the space M of unimodular lattices C tlie function 

(6) $(£)= h{x{e))lj{y{e))lK{xyz{e)). 

edC prime 

and on M X M define an M x T valued function 

(7) m{C,h) = {m,{C),^!2{C,h)) = 

J2 li{x{e))lj{y{e))lK{xyz{e)){M^xyz{e),bz{e) mod (2)). 

eG£ prime 

Given N, suppose tliat ^ G X is sucli tliat for every ti,t2 G [0,M] 
tliere exists at most one {k,l) G Z{C,, N) sucli tliat 

(8) e*^ <Li< e*^+\ e*^ < IL2I < 6*^+^ 

where Li = ayk + 61/, L2 = + &2^- Note that if ([s]) holds then 
(/c, /) G Z iff t2^(,^)) = 1. Thus, for such ^ we have that the 
sequence 

{M^{aik + bil){a2k + foaOII"^ + /3^||,X(fc« + ll3)mod{2)}k^i^z 
is exactly 

{v|/(M(0,A^e-(*^+*^))},en,*(,.A(e))=i 

with 

n = {t = (ti, ^2) G : ti + t2 < (1 - 5)M}. 

Hence, to show that the distribution of {M^{aik+bil){a2k+b2l)\\ka+ 
lf3\\, N{ka + 1(5) mod {2)}k^i^z converges as X — )■ 00 to that of a Pois- 
son process on [—-,-] x M/(2Z) with intensity 2(1 -5) it is sufficient 
to have (a) and (6) of the following theorem. 

Before we state the theorem we introduce the following straightfor- 
ward notion of splitness between the coordinates of a sequence of points 
in n. 

Definition 4.1. Let A > 0. Consider a sequence . . . , t*^*-*} of 
points in 11 where t^^^ = {ti\t2^). We say that this sequence is is 
74-split if for any pair i,j we have 

> A |4'^-4'^| > A and I max(tf\t?)-max(4^'\4'))| > A, 
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and for any i we have t^^^ > A, "* > ^• 
Let 

n = {t : ti > 0, t2 > 0, ti + 12 < 1 - (5} 

n = {t : ti > 0, ta > 0, ti + ta < M(l - 5)} 

Theorem 8. Assume that ^ G X distributed according to a probabil- 
ity measure with smooth density with respect to the Lebesgue measure. 
We will denote by A the matrix A(^) as defined in Then 

(a) For any t e U, P($(^*A) > 1) = C(M-^). 

(b) {\I/((7*A, Ne~'^^'^^*'^'>)}is>(^gt/^)=i^i^Yi converges as N ^ oo to the Pois- 
son process on [—\, \] x ]R/(2Z) with intensity (1 — 5)^Ci. 

(c) Let t^^\t^'^\ . . . G n 6e the points such that ^{gtA) = 1, listed in 
any order. For any s ^N, we have that 

P({t(^), . . . , t^'^ is Vm - split) ^ I as N ^ oo. 

In order to get the full Poissonean limit in Theorem [7] we will also 
need the following fact. 

Theorem 9. Let K > be fixed. Given any s G N, let , /f ), . . . , (A;f , /f ) 
be a sequence of s 2-tuples such that the sequence 

t^^^ = ([In \aikj + biljW, [In \a2kj + 62/jl]) 

is \/M -split ([■■■] denotes the integer part). 

Suppose that {u, v, x, y) are distributed according to a density such 
that 

(9) \\pN\y<K. 
Then the distribution of the s 3-tuples 

{{aiki + bili)u}, {{a2ki + 62/1)1;}, {kix + hy + (j)i^,ki,a,i3,N} 

{{aiks + bils)u}, {{a2ks + b2ls)v}, {kgX + hy + (f>h,ks,a,^,N} 

converges to the uniform distribution on T^'^ and the convergence is 
uniform with respect to N and (ai, 02, &i, 62, a, and to the choices of 
s 2-tuples satisfying the splitness condition and pN satisfying ([9]). 

Proof of Theorem We need to show if fj : T'^ — ?■ C are exponentials 
fj{di, 02, 03) = exp I 27Ti ^ rrijkOk 

\ k=l 



18 



DMITRY DOLGOPYAT AND BASSAM FAYAD 



and not all mjk are equal to zero then 
(10) 

/ IT fj (('^i^j + bilj)u, {a2kj + b2lj)v, {kjx + Ijy + 0/,,fc,,a,/j,Ar)) 

Pn{u, V, X, y)dudvdxdy — )■ 0. 

uniformly in the parameters involved. Note that if not all iriji are 
equal to zero then the coefficient in front of u in the above product 
is large since, due to splitness, it is dominated by the contribution 
of the largest of \aikj + hilj\. In this case we show that the integral 



(10) is small by integrating by parts with respect to u. Similarly if 



not all mj2 are zero we can show that the integral (10) is small by 
integrating by parts with respect to v. Finally, suppose that all rriji 

and all mj2 are zero. Let j be such that max(t^"''*, ^2"'^) is the largest 
among those indices for which m^s ^ 0. Note that either k-j or (or 

both) is of order exp(max(t["'\ 4"'^))- case > we have that kj 
dominates the coefficient in front of x and so we conclude that (10) is 
small by integrating by parts with respect to x. In the opposite case 
when \k-j\ < we conclude that (10) is small by integrating by parts 



with respect to y. □ 

Proof that Theorem ^implies Theorem As mentioned before, (a) 
and (b) of Theorem^ imply that {M\aik + 61/) (02^ + h2l)\\ak + 
l3l\\,N{ka + 1(3) mod 2}k,i&z converges as — )■ 00 to a Poisson pro- 
cess on [-|,|] X M/(2zj with intensity 2(1 - 5fci. Next, (c) of 
Theorem |8] implies that the hypothesis of Theorem |9] holds. Thus 
{ ({(aifc + bil)u}, {{a2k + b2l)v}, {kx + ly + </'i,fe,a,/3,Jv})T(fe,/)ez(e,7V) 
verge to uniformly distributed iid's on independent of {M^{aik + 
bil){a2k+b2l)\\(yk+/3l\\, N(ka + 1(3) mod (2)}^ ^^^(^ tv)- Lemma|2]hence 
yields the full Poissonean limit of Theorem [7j □ 
Before we close this section we use a last observation that allows 
us to complete the reduction of our problem to a clear cut dynamics 
problem on the space of lattices, namely the following. 

Theorem 10. Assume that C has a smooth density on M. Then 

(a) For any t e U, P($(c/*£) > 1) = C(M-^). 

(b) {^{g^'C, Ne~^^^^^^^)}ij>(gtc)=i,teu converges as N ^ 00 to a Pois- 
son process on [— ^, ^] x ]R/(2Z) with intensity (1 — 5)^Ci. 

(c) Let t^^\t^'^\ . . . Eli he the points such that ^(gtC) = 1, listed in 
any order. For any s eN, we have that 

P({t(^\ . . . , t^')} IS Vm - split) ^ I as N ^ 00. 
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Proof that Theorem [7^ implies Theorem \^ 

Let rj > and define for an interval A = [a,b] the intervals A'^ = 
[a{l — ri),b{l + rj)] and A~ = [a{l + ri),b{l — t])]. Fix some interval 
K C K. Let $^ be defined as in ^ with the intervals J^,^^ 
instead of /, J, K. Next, given A = A(^) for some ^ E X, define 

f (l + CTl) 

A = (1 + aa) 

V (1 + ai) 

where Ui, (T2, Ci and C2 are random variables such that (oi, 61, 02, &2, /3, cti, (T2, Ci) 
has a smooth distribution. The equivalence between Theorem [8] and 
Theorem 10 stems from the straightforward observation that if M is 
sufficiently large, then for any n G N it holds that 

*~(^ti+ln(l+ai),t2+ln{l+<T2)A) > n =^ ^{gt,,t2^)>n 

=^ '^'^(5'ti+ln(l+(7i),t2+ln(l+<72)A) > n 

□ 



The rest of the paper will be devoted to the proof of Theorem 10 



5. Rate of equi-distirbution of unipotent flows. 

In our argument we shall use the fact that the action of gti,t2 M 
is partially hyperbolic in the sense that 

3 

TM = Eo + J2 K ® 

g=l 

where Eq is tangent to the orbit of gt and are invariant one dimen- 
sional distributions. The corresponding Lyapunov exponents are ±Xq 
where 

Xl = 2ti + t2, X2 = ^1 + 2t2) X^ = ti — ^2- 

-E^ are tangent to foliations which are orbit foliations for groups 
/ig where 

/ioo\ /ioo\ /ioo\ 

h+{u) = 1 , h^{u) = 1 , h+{u) = I u 1 \ 
\u 1 J \0 u 1 J \001/ 

and h~{u) are transposes of hg{u). Below we shall abbreviate Wg = 

q 
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Definition 5.1. Given s, r > 0, we say that a function A : M 



is in H^''^ with ||A||s,i 
A- < A< A+ such that 

\\A+ -A- 



K if given < e < 1 there are if^-functions 



< e and \\A^\L < Ke' 



Definition 5.2. Fix > 0. Let L > and P be a partition of M into 
Wg-curves of length L and denote 7(0;) the element of V containing 
X. Given a finite or infinite sequence of integers {fcn} and a function 
A G if^'"", we say that V is KQ-representative with respect to {{kn},A) 
if for any n 



(11) 



fi\x: 



1 



- A 



< L: 



where A = Jj^ A{x)diJ,{x), ICa 



\A\\s,r + 1, and L„ 



length of the Wg curve g'^"'j{x) that goes through g''"{x). 
We call the points x such that 

1 



\/n 



A{s)ds - A 



representative with respect to (V, {kn}, A). Observe that if 

n 

then the set of representative points has measure larger than 1 — e. 
The goal of this section is to show the following. 

Proposition 11. There exists s, kq, > such that for any < r < s, 

< e < Eo, and any function A G H^ '^ , and any L and a sequence {kn} 
satisfying 

n 

then there exists a partition V of M into Wq-curves of length L that is 
KQ-representative with respect to ({A;„}, A). 

The requirement that r < s will only serve to maintain the exponent 
K in the speed of equidistribution in (11) bounded from below. Any 
upper bound on r would yield a lower bound on n but it will be sufficient 
for us in the sequel to consider functions in H^'^, since we will have to 
deal with characterisitc functions of nice sets (cf. section 6.3). 

Proof. Without loss of generality we will work with functions A having 
zero average, that is A = 0. We will first prove proposition 11 for 
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A G and then generalize it to A G H^''^. Also, we will give the proof 
for the case g = 3 the other cases being similar. 

By [l2], Theorem 2.4.5 there exists s and constants C, c > such 
that a A,B e and if 



e*i 
e*2 
^0 e*3 

where ti + ^2 + ^3 = then 

(12) |/i(A(x)i?((7x))-/i(^)Mi?)l <CPIIs||fi||se-'='"'^^l*^l. 

We claim that this implies that there exists C > and n > such that 

(13) \fi{A{x)B{h:,{u)x)) - fi{A)fi{B)\ < C\\AMB\Uu-\ 
Indeed let 6 be such that tan^ = e~^* and let 



where 



U{t) = R{9) 



m 



' 

e* I R{- 
1 

cos 9 sin 9 

— sin 9 cos 9 

1 



A simple computation gives that U{t) = h^iu) with u = e + o(l), 
hence (13) follows from (12). Now, assuming that /^(A) = 0, (13) 
implies that 

\^i{A{x)A{h^{u)x))\ < ClC\u-'' 
with Ka = 1 1^1 Is, thus for Sl{x) = f A{h^{u)x)du we have 

M5l) = 0, fiiSl)<CL-^}Cl. 

This implies that 

(14) /i(x : \Sl{x)\ > JCaL-^/^) < CL-^/\ 

Next let V be an arbitrary partition of M into W^-cnrves of length 
L and let = h3{Lu)V. Then by ^ 

1 
L 



x,u) 



A{s)ds 



< ICaL' 



< CL- 



where /i denotes the product of /i and the Lebesgue measure on [0, 1] 
and kq = k/3. Thus we can choose u so that "P" satisfies 

1 



(15) 



/X X 



A{s)ds 



7(x) 



> ]CaL- 



< CL- 
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If L is large we can drop the constant C if we let be slightly 
smaller than k/3. Likewise, if {kn} is a finite or infinite sequence with 

n 

then there exists a partition V that is representative with respect to 



({/c„},y4) as in definition 5.2 



To extend (15) to functions in H^'^ (that eventually have infinite 



if^-norm), we use a standard approximation argument. Note first that 



( 14 ) still holds for non zero mean if^-functions if we let 



St{^)= ( I / A{h^{u)x)du]-ii{A). 



On the other hand, we have that < < e, hence if JCa = 1 1^| |s,r + 1 
we have that 

So, if we choose e and k such that e = )CaL~'^ ~ )Cas^^L^'^° , that is 
e ~ L~'^ and k = K,o/{r + 1) we get that 

Using to bound {^Sf^{x) < — we see that (15) and thus 
the rest of the proof extends to H^'^ functions, provided the exponent 
Ko is reduced. □ 

6. PoissoN Limit Theorem in the space of lattices. 
6.1. Multiple solutions. 

Lemma 12. Assume that C has a smooth density on M. Let $ he 
defined as in M). Denote $* = $ o (7^. Then we have as M 00 and 
for any t,t' - {0, 0} 

(a) E($*) = C(M~2). 

(a) E(<l>*) = Ci-M-^ + C»(M-^°°) if min(ti,t2) > 

(6) E((<l>*)^ - $*) = 0{M-^) and hence P(<l>*(£) > 1) = 0{M-^) 

(c) P(<l>*(£) ^ and <^\gt'C) ^ 0) = 0(M-^). 

Proof. Without loss of generality, we can assume in the proof of the 
inequalities (a), (b), (c), that £ is distributed according to the Haar 
measure on M, and by invariance of the Haar measure take t = 0. The 
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inequalities then follow from Roger's equalities of Lemma |4j Indeed, 
part (a) of Lemma |4] implies that E($) = Ci|M~^, since 

4 

li{x)lj{y)lK{xyz)dxdydz = -M~^. 

On the other hand, if for e = (x, y, z) G £, we let /(e) = 1/x jxE'la^, xyz), 
then since / is an interval of positive numbers, we have that 

ei^e2&L prime eij^±e2&L prime 

and the first estimate of part (b) follows by Lemma |4]^b). The second 
estimate follows from the first by Markov inequality. As for (c) observe 
that if we define, for e = {x,y,z) e C, g{e) = l^.*/ ^^^^z+t/ ^(x, xyz), 
then 

E($$*') = / Y f{ei)9{e2)dfi{C) 

Jm /I 

e2^±e-i(iL prime 

where the contribution of 62 = — ci vanishes because both I and e~*'iJ 
are positive intervals, while the contribution of 62 = ei vanishes since 
either / and e~*'i/ or J and e~^'^J are disjoint. Applying Lemma |4](b) 
we get (c). Since in the case £ is distributed according to the Haar 
measure we have that E($) = Ci|M~^, then (a') follows for a smooth 
density by exponential mixing of the geodesic flow. □ 



Proof of Theorem a ) . Part (a) of Theorem 10 is exactly the second 



statement of part (b) of Lemma 12 □ 



6.2. Poisson limit distribution for the visits to the cusp. In 

this section and in sections 6.3, 6.4 and 6.5, we will prove the following 
result. 

Theorem 13. {'^i{gtC),'ti/M,t2/M}^(g^c)=i,t<^n converges to the Pois- 
son process on x H with intensity 2ci. 

Combining Theorem 13 with Lemma |2](a) we see that the first com- 
ponent {^i{gtX,)}^(gtC)=i converges to a Poisson process with intensity 
2ciArea(n) = Ci(l — 5)^. This result will be extended in section 6.6 to 
yield Theorem [lOj 



Proof. Similarly to the function $ defined in (|6]), introduce for the rest 
of the paper the following shorthand notation. Divide K = [—j^, 
into a finite number of intervals Ki , K2 ■ ■ ■ and let 

(16) ^t = HgtC), ^t,p = %{9tC) 

where $p is defined by ^ with K replaced by Kp. 
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Also let $ be defined by (|6| with K replaced hj K = {z : d{z, dK) < 
M~^^^}. Last, consider the following collection of functions 

$ = {$i...<l>^,<l>?...$^,l>}. 

Pick a small number 6 <^ 6. Divide n into strips parallel to KerAi 
of width 6M. We call these strips Hj. Then, further subdivide these 
strips into squares Ci, . . . , Ch of side size 6M. The part of n that is 
left over has negligible measure. 

Fix k G Z+. Pick k squares Si, S2 ■ ■ ■ Sk C {Ci, . . . , Ch} with centers 
We call the square configuration 5-generic if their images under 
Ai are distant by more than 36M. Also fix an index iq E {1 . . . mi} for 
each 1 < q < k. 



To obtain Theorem 13 , we shall prove 



Lemma 14. 

(a) F{3t',t" eU:^t'= 6" = 1 and |Ai(t') - Ai(t")| < 3SM) ^ as 

(h) If Si, S2 . . . Sk is generic then 
F{^t = forte U\[j Sq and 3t^'^^ E Sg : = 1 while = for t E Sg 

{2cirS'' liTj j exp (-^Siilzi)!^ (1 + ,-^^(1)). 



Proof that Lemma I4 implies Theorem 13_. Divide 11 into subsets 
Vi,V2- ■ ■Vm2- Suppose that we want to find the probability that for 
each (p, s) G [1, . . . ,mi] x [1, . . . ,^2], there are Ip^s points, satisfying 

^EVs,-^i{gtC-)eKp. 



We will apply Lemma 14 with k = ^p^i mi 5=1 m2 ^p.«- -^^^ each 



s, there are n. 



^''"^^2'^"'' squares in MVs- 



By Lemma 14 a), the contribution of non-generic choices of k squares 



is seen to be negligible as S — )■ 0. On the other hand by Lemma 
14rb), generic choices contribute (recall that Area(n) = (1 — S)"^, and 
E|A'.| = f) 



n 

p,s 



2ci\Kp\6'^ 



exp 



4ciArea(n) 



p,s '- 



{2ci\Kp\Area{V,)y^ 



exp(-2ci|irp|Area(P,)) 
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Proof of Lemma 14(o,)- By Bonferroni inequality 

P(3t',t" G n : ^t' = it" = 1 and |Ai(t') - Ai(t")| < 3SM) 



|Ai{t')-Ai(t")|<3<5A/ 



6" = 1) = 0(6) 



where the last inequality follows from Lemma 12 



□ 



Remark. Define 7m (i^) = M + ti. The same argument as in the proof 
of Lemma [l4|(a) actually yields that 



(17) F{3t',t" eU:it'= 6" = 1 and \v' - v"\ < 35M) ^ as 5 ^ 
where v' G {Ai(t'), 7A/(t')} and v" G {Ai(t"), 7A/(t")}- This will be 



useful in the proof of Theorem 10 that will be given in section 6.6 



Before we prove Lemma 14 in Section 6.4, we first prove a standard 
estimate on the Hg s norms of $, $p and ^. 



6.3. Estimates of norms. 

Lemma 15. For any s > we have that 



(a) ||$||,,s = 0(l), 

ll*'l|s,s = 0(l), I 

(b) E{<i>,) = 2c,\K, 



1$ 



1$ 



= 0(1), I 
0{1). 

E(<l>) = o(Af-i°°; 



1 1 1 s,s 
s,s 



$1 



Proof, (a) We shall prove the bound for $, the estimates of $j and $ 
being similar. Let be a C°° function such that (j){z) = 1 for z < 0, 
(j){z) = for z > 1 and < 0(2;) < 1 for < 2; < 1. Given an interval 
K = [ki,k2] let 



k: 



z 



e 

z - k2 + e 

e 



ki+ e 



e 

z — ki 

e 



Then $^ < $ < where $^ is defined by ([6| with /, J and K 
replaced by If , Jf and K'f , and $~ is defined by ^ with /, J and K 
replaced by /~, J~ and Kj . By Lemma |4|^a) ||$^ — < Ce and 

||'^*^||s < Ce~^. The norm of $^ is estimated similarly but Lemma[4]^c) 
has to be used in place of Lemma |4]^a). 

(b) The result follows directly from Lemma |4](a). □ 
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Figure 1. A generic square configuration and strips of 
type A, B and C. (Type C strips wliicfi need not lie in 11 



will be used in the proof of Theorem 10 (b). See subsec- 
tion 



6.6). 



6.4. Proof of Lemma |14Kb). 

Proof. Let {Sg}g=i^,,,^k be a fixed 5-generic configuration of squares and 
let {iq} be a sequence of indices with values in {1, ... , mi}. 

We call a strip Ilj which contains a square of {Sg} a type A strip. 
The remaining strips (they are a majority) are called type B strips. 
We recall that the 11^ are parallel to KerAi, that is lij = ^i^[(j-i,Cj]- 
These strips have common boundaries. To create some independence 
we let (j = Cj-i + VM and let Ilj = ^i^[CjXj]- Lemma 
that F{^t > 1) = 0(M-2), hence 



12 



a) implies 



p(3t G n \ U : > 1) = o{i/Vm). 



Accordingly we can concentrate on the contributions of t G Uj-nj. 

Similarly, we may assume that min(ti, ^2) > \/M. We keep the notation 
type A and type B for the reduced strips llj. 

If Tlj is of type B we say that it is compatible if = for all t G 11^ . If 
Tlj is of type A we say that it is compatible if for q such that Sg C 11^, 
there exists t G Sq such that ^t,ig = 1 and = for t G tlj — {t}. 
Denote Pq = 1, 

Pj = F{Yli are compatible for / < j). 
We shall show that if Ilj+i is of type A then 
(18) p,+, = 2c,\Kj'p,il + osil)) 
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and if Hj+i is of type B then 

/ 4ci 

(19) = pj il - ^Area(n,+i)(l + o(l)) 

with flj+i = Tlj+i/M and o(l) is a term that goes to as M — t- oo. 



Combining (|18 
shall prove 



and (19) for all j we obtain part (b) of Lemma 14 We 
18), (19) is similar. 
Let Vj be a sequence of increasing partitions of size L = [e''^ M^^^)^^ 
such that Vj is Ko-representative with respect to [t, $) for every t G 



Llj+i, where kq is given by Proposition 11 This is possible by the latter 
proposition since $ C H^'^ and 

J2 {Le'^^'Y''' <t:l. 



ten 



Given t G Uj+i we also take partitions Vj of size = (e^i(*)M^°°)"^ 
which are representative with respect to {{i G Hj+i : Ai(f) > Ai(i(:) + 
i?lnM}, $), where i? is chosen larger than 1000. We can assume that 
Vj refines Vj by adding the endpoints of Vj to Vj. Let Tj,T] denote 
the o algebras generated by Vj and respectively. For L G M, Tj{C-) 
denotes the element Tj G Vj that contains L. 

Observe that Ea6n,.„A.®>A,w+HinA./ (^^e^^^*"))-"" = O(M-ioo). Thus, 
if we let Ej be the set of L such that 

(Rl) £ is representative for (t,Vj, $) for every t G Hj+i, 

(R2) Except for a proportion (9(M~^°°) of the measure of J-'j{C), 

C G J^j{C) is (t, representative for every i G Hj+i such 

that Xi{i) > Xi{t) + R\nM, 
(R3) E{Vj+i\J^j) = 0(1/ VM) where Vj = Et,t'm,:\Xr{t)-xm\<RinAi ^t&^ 
then P(£;|) = C(ln M/v^M). Indeed (R3) is violated with probability 



C(lnM/v M) due to part (c) of Lemma 12 and Markov inequlity while 
the probability of violating (Rl) or (R2) is even small due to the results 
of Section [H 

Note that the set {IIi ■ ■ - Tlj are compatible} can be modified on a 
set of measure O (M"^") so that the new set is J-'j measurable. In- 
deed, if lni...n,- are compatible IS not coustaut ou J^j{C) then gt{Fj{C)), 
that is of length 0(M~^°°), passes in the 0{M~^^^) neighborhood of 
the boundary of the set defining $ for some t G IJj<j Hence 

(20) pj+i = P(ni . . . flj are compatible and flj+i is compatible) 
= E(lni...n,- are compatibieP(ni+i IS Compatible \Fj)) + o(M~^°). 
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Our goal for the rest of this section is to prove that for L G Ej we 
have 

(21) P(nj+i is compatible \ J=j) = 2ci\KiJ6^{l + 05^0(1)). 
We let 

(Note that, in fact, rjt = lgt=i, and rjt^p = 1^4^=5^=1 but we use a more 
complicated definition above to emphasize that rjt ^ ^t, rjt^p ~ ^t,p-) 

We then get the following 
Claim. For C G Ej and t,i e Uj+i with Ai(f) > Ai(t) + i?lnM 

(22) m.JJ'j) = ^-^^ + 0{M~') 

(23) E{C - ^tlJ",) = 0{M-') 

(24) HVtVtlJ'j) = 0{M-^) 



Proof of the claim. Because C is representative for (t,Pj,$), (23) 
follows from part (b) of Lemma 12 Now, since 

6,*, - (^L„ - 6,iJ < Vt,i, < ^t,i, and < r]tr]t' < 6, 



(22) also follows from Lemma 12 Equality (24) needs a little more 
work. Since ?7t < 1 and r^f < 1 we have 

HVtVtlJ'j) = E(E{r]tVt\J'!j)\J'j) 

< E(E(liE(,^|^*)=ir/f|J-;)|J;-) +P(0 < HVtlJ^j) < MJ'j)- 

Next, 

E(E(lK(,,|^|)=i77f|J-|)|J-,) = E(lE(,,|^*)=iE(r/f|-F|)|j;-)- 
Observe now that if C G J^j{C) is (t,Vj, $) representative, then 

HvtlJlic')) < E(6-|^l(/:')) = mt) + o{M-''') = 0{M-'). 

Due to (R2) we get that 

E(E(lE(,,|^*)=ir/,-|-^;)| J-,) < 0(M-2)E(l^(,,|^.)=i|J-,) + 0(M-io°) 

< 0{M-'^)E{r]t\J^j) + C(M-^°°) = 0{M~^). 

On the other hand if < E{r]t\T!j{C')) < 1 for some C G J^j{C), then 
z(c/ti:') is M-i°° close to the boundary of K, that is P(0 < E{r]t\T^) < 

l\J^j) < E(<l o g^\J^j) = C(M-i°°), the last bound coming from (Rl). 
The claim is proved. □ 
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Back to the proof of (21), we have that 



P(3t G n,+i : 6 > 1| J-,) < Yl - 



Let 



/ = P(3t es,n n,+i : 6,^, = i, and 6- = for t> t J G n,+i| J^) 

J = P(3t G n Uj+i : r/i,i^ = 1, and 7]t = for t ^ t, t G IIj+il J^) 



Then, due to (|23D, 

(25) 



0</_j< J2 ^i^t - ^tlJ'j) = 0{M- 



ten 



Next, since for a fixed t & Sq, Bonferroni inequahties imply that 



lt,i^ = 1, and r]t' = for t' ^t,t' e flj+ill'j) < E{r]uJJ^j] 
then 



(26) 



using (R3) for bounding the terms with \\i(t) — Ai(t')| < _RlnM (note 
that rjt < 6), and (|24|) for |Ai(t) - Ai(t')| > RlnM. 
Due to (22) we get that 



(27) 



Y Eir^,,JJ^,) = 2c^\K\\5\l + oil)). 



Now, (21) follows for C G E. from (25), (26) and (27). Finally, 



Pj^i = P(ni • • • Hj are compatible and Hj+i is compatible) 

= P(ni ■ ■ -fi-j are compatible and 

Ilj+i is compatible and C e Ej) + O (M"^°) 
= p,-2ci|ir,J52(^^^^^))_ 



This completes the proof of (18). The induction estimate (19) at a type 



A strip follows lickewise. Part (b) of Lemma 14 is thus completed. □ 
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6.5. Proof of Theorem |lO| (c). Note that for a Poisson process 
on [0, -] X n, given rj > and any s G N, we can find 6 such that 



P({t(^'^}^'=i is 5-spht) >l-r]. 



Therefore (c) of Theorem 10 follows from Theorem 13 □ 



6.6. Proof of Theorem [To](b). In the proof of the Poisson limit for 
{'^i{gtC),t/M} an important role was played by the foliation of M 
into the leaves of Wi. More precisely, in the typical situation where 
t^^\ . . . ,t''^^ E Z are sufficiently separated under Ai (our genericity 
condition), the independence between the is due to the fact 

that is determined on a scale e~'^^^*^'^^ of Wi leafs, a scale 

on which the successive \E'i(5fjO)-), j > i yield equidistribution due to 
uniform stretch. 



The key to the proof of part (b) of Theorem 10 is that \E'i and 
\l/2 are determined at different scales along the Wi leafs. More pre- 
cisely '^2(9^') is determined on a scale l/(A^e*i ), which means that 
\E'2(5't(i)'C) is uniformly distributed on [0,2] if £ moves along a Wi leaf 

of size much larger than l/(A^e*i ). As a consequence, and since the 

scales {e-^i(*'"), . . . , e-^^^**^'), l/(Are*i''), . . . , l/(Are*i'')} (that have to 
be rearranged in an increasing order) are typically sufficiently split 
(see the generectiy condition below), the independence of the quan- 
tites ^i(5ft(i)£), . . . , 'ifi{gtis)C), ^2{gtwC), ^!2{gt(.-)C), is ensured. 

Thus, the proof of Theorem [lO|^b) proceeds along the lines of the 
proof of Theorem [13] with the modifications described below. 
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First, in addition to choosing {Sq}q<k and {iq}q<k as in Lemma 
we also divide into segments Ti . . . T^a and choose a sequence {I^ 
with values in {1 . . . m2}. 

A strip Yij is said to be of type C if it contains the points (0, ti + M) 
for t = (ti,t2) G Sq for some q G {1, . . . , k}. Indeed, by readjusting 
slightly the strips we can assume that for any choice of q, all the points 
{0,ti + M) for t = (ti,t2) £ Sq fall in a same strip. We call the latter 
points resonant with the square Sq. 



By (17) we can focus on generic configuration of squares, which in 



this setting amounts to having all the strips of type A or C are disjoint. 



Part (b) of Lemma 14 has to be modified as follows: 
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If Si, S2 ■ ■ ■ Sk is generic then 

F Ut = fort En\[jSg and St^") G Sg : 

Cti'.),i, = 1, ^2 (^gti.^C, -^^^^^ e Ti^ while ^, = forte Sg- 1^^^^ 



(l + oj^o(l)). 



As in the proof of Lemma 14 (b), the latter probabihty is estabhshed 
inductively. The definition of compatibility for type A and B strips 
remains essentially the same as in the proof of Lemma 14 (Namely, 
the only distinction is the following. Note that in general n need 
not contain any point that is resonant with Sg. For this reason it is 
convenient in this section not to require Uj to be subsets of LI but 
rather consider the strips on the plane 

= {t G : (j - 1)6 M < Ai(t) < jSM}. 

Indeed, some or all of the type C strips may be disjoint from LI. 
Now, in the definition of compatibility we require that = for all 



t G Llj n n,t 7^ tg and (19) takes form 



(28) 



Pj+i = Pj 



4ci 

1 - ^Area(nj+i n n)(i + 0(1)) 



where Hj+i = Hj+i/M. In particular if Ilj is disjoint from LI then it 
is of either type B or of type C. In case it is of type B the restrictions 



become trivial and so we have pj+i = pj in accordance with (28). 



Let LIj+i be a strip of type C and q be such that LIj+i contains the 
points (0,ti + M) for t = (^1,^2) ^ Sg. Note that j + 1 > z for the i such 
that Ilj contains Sg (because in LI, ti,t2 > and ti +t2 < M(l — 6)). 
Hence, in the definition of compatibility of LIj+i, we assume given the 
value of t^'^^ e Sg cHi such that $,t(i)^i^ = 1- 

We then say that Hj+i is compatible if 



(29) 



= for t G LIj+i and ^2 



N 



AQ),Aq) ' 
g"-! "'"'•2 



gt(g)C G Ti 



Recall the definition pj = are compatible for / < j). Then the 



proof of Theorem [lO[b) is the same as the proof of Theorem 13 if ( 18 ) 
and (28) that determine the inductive relation on the probabilities pj 
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are supplemented by the following equation when Ilj+i is of type C 

p,+i=p,|T,J(l + o(l)). 

We just need to show that away from a set of negligibly small measure 
we have 



.(9) 



(30) P(nj+i is compatible | J^) = |T;J(1 + o(l)). 

Proof of Q. Since {0,t'f '> + M) is in n^+i, we have that iVe*i" > 
gV^gO^ where C,j is the maximal value of Ai on lij. We claim that this 
implies that apart from a small exceptional set we have 

(31) p(vi;,(^_^,^^,,,£^ er.Jj;.) =1^^^ 

We can restrict to elements of J^j on which {g^^q) ■) is constant. Indeed 
if G* denotes the union of elements of J-'j such that ^{gi{q)-) takes 
the value zero on this element, then the complement of G* has small 

while by definition of 0, {gtM') is constant 



measure due to Lemma 
on the elements from G 



15 

■7* 



1' 

Now, let C and £ belong to a same element of J^j fl G*, such that 

C = hlC with |r| < (e^^Mi°°)-\ and such that ^i^{gt(q)C) = 1. Let 
(x, y, z) be the vector of gt{q)C such that (x, y, xyz) E I x J x Kg. Then 
the vector of gt{q)C that is in / x J x i^g is given by {x,y,z) where 
z = z + e^^^'^^''^^Tx (this is because C,j is the maximal value of Ai on Ilj, 
hence re^i(*'''') = 0{M-^^^)). Since 

( J = -^^z + Nxe''^\ mod (2), 

and r varies on the interval of length (e''^M^''°)~^ the equidistribution 

(|31) follows from the fact that A^e^i"' > e'^e^K 

On the other hand, the first condition in ( [29] ) is not very restrictive 
since it is violated with probability o(l) due to (19). Thus (30) is 



established finishing the proof of Theorem [lO[b) . □ 

7. Small boxes. 

One can also consider the visits to small boxes = Hj jh\ ■ 

The case 7 = is treated in Theorem [l] while the case 7 = 1/c? was 
studied in [13]. For 7 > l/rf most orbits do not visit Cn so we consider 
the remaining case < 7 < \. 



Theorem 16. Under the assumptions of Theorem 
verges to the standard Cauchy distribution. 



Dc^{x,a,N) 
p{{l-d-y)lnN)'i 
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The proof of Theorem 16 is the same as the proof of Theorem [T] 



except that now we can neglect the contribution of ks where \kj\ < N'^ 



for some j (cf. Section 3.3). Accordingly in Thereom 10 11 has to 
be replaced by 11^ = {t : tj > 7M, tj < M} which decreases the 
intensity of the limiting Poisson process by a factor (1 — d'j) 



d 



8. Continuous time. 

In this section we discuss briefly the behavior of the discrepancy 
function in the case of linear flows on the torus. Given a set C we the 
continuous time discrepancy function as 

Bciv,x,T)= [ xc{Slx)dt-TVo\{C) 
Jo 

where Si = x + vt. 

In the case of balls, it was shown in [5] that for d > 4, the continuous 
time discrepancy function has a similar behavior as the discrete time 
discrepancy, namely it converges in distribution after normalization by 
a factor T('^-3)/2(rf-i). 

Curiously, for balls in dimension d = 3, the continuous time discrep- 
ancy behaves similarly to the discrete discrepancy of cubes and gives 
rise to a Cauchy distribution after normalization by In T. This will be 
proved in Section [8^ below. 

It was also shown in [5] that for balls in dimension d = 2 the contin- 
uous time discrepancy converges, without any normalization, in distri- 
bution. In the next Section 18.11 we will show that this is also the case 
in any dimension d > 2 for the continuous time discrepancy for boxes. 

8.1. Boxes. Let C = A{Y[j{0,Uj)). We assume that the triple {A,x,v) 
is distributed according to a smooth density of compact support and 
that A G SL(d, M) is such that ||A — J|| < t] where 77 is sufficiently 
small. 

Theorem 17. As T ^ 00, Dc(f,x,T) converges in distribution. 
Proof. We have 

Dc {v, x, T) = 4^ ^ n % (^^•) ''""wfc'jr^ cos(27r(fc, x) + 

k j 

where ^m{u) = sm{27cmu)/m and kj is given by (g. We claim that 
for almost all A,v there exist a constant C{A,v) such that 

\\Bciv,x,T)\\L2<CiA,v) 
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and moreover for each e there exists N = N{A, v) such that 



ETT / .sin(7r(k,vT)) ,^ . 
n ("^^ Jk y) ^) + '^^.^.'^^ 

|fe|>Ar j ^ ' ^ 



< 6. 



LI 



To this end it suffices to demonstrate that for almost every [A, v) 

-2 



oo. 



Since det(A) ^ there exists 5{A) such that for each k there is I e 
{1 . . .d} such that \ki\ > 5\k\. Accordingly it suffices to check that for 
each / '^k^k{A,v) < oo where 

-2 



llk,]{k,v)\k\ 



All sums have the same form so we consider the case I — d. Given 
numbers Si, . . . s^-i, and £ > denote ft{k, Si . . .sa) — 

{{A,v) : \kj\ e [\k\'^,\k\'i+'] for j = and |(^;,A;)| e [|A;|"MA;|"''+ 



Then 



F{n{k,si...Sd)) < C\k 



\s+d6—d 



where s — Yl'j=i ^j- We draw two conclusions from this estimate. First, 
for almost all {A, v) we have 

Y[kj {k,v) > iki-^"^ 
\j=i J 

provided that |A;| is large enough. 
Second, for s > —2de we have 

mniKs,...sa){Av)Tk{A,v)) < C\k\''-^^''+'^+^l 

Hence 

E (^laik,s^...saM,v)rk{A,v)j <oo. 
Summing over all d-tuples (si . . . Sd) € {eZy such that 

d 

Sj < 1, s — E^ Sj > —2de 
we get E (^f, ^k{A., v)) < oo proving our claim. 
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The claim implies that for large N the distribution of T)c{v,x,T) is 
close to the distribution of 

^ / .ri sr^ TT I / , sinfvrf/c, t>T) , ^ 

Dc,^(t;,x,T)=4'^ J2 n'^%(^^) l^d l^^ ^ os(27r(fc,x) + 0fc,T,.). 

Hence it remains to prove that T)^ j^{v,x,T) converges in distribution 
as T — > oo. This convergence follows easily from the fact that as T — i- oo 
{vT} becomes uniformly distributed on (M/2Z)'^. □ 

A similar argument shows that randomness in C is not necessary, 
namely we have the following result. 

Theorem 18. Let C = Ylj{0,Uj). Suppose that the pair {x,v) has a 
smooth distribution of compact support. Then Dc(f,a;,T) converges in 
distribution as T —t- oo. 



The proof of Theorem 18 is similar to the proof of Theorem 17 with 
the additional simplifications since now \kj\ > 1 and so only {k, v) may 
possibly be small. Therefore we leave the proof to the reader. 

8.2. Balls. In this section, C is assumed to be a ball of radius r in T^. 
We suppose that v is chosen according to a smooth density p whose 
support is compact and does not contain the origin and r is uniformly 
distributed on some segment [a,b]. Let a denote the product of the 
distribution of v, the distribution of r and the Haar measure on T^. 

Theorem 19. There exists a constant 5 such that ^^t"''')'^^'^'-^^ ^on- 

1^ pr In T 

verges to the standard Cauchy distribution. 

Proof. The proof is similar to the proof of Theorem [T] so we just outline 
the main steps. We have 

^B{o,r){v,x,T) = ^ fk{r,v,x,T) = ^ 

kalfl kalP ,k prime 

whprp f, — r, cos[27r(fc,x)+7r(A:,Tii)]sin(7r(fc,Tt))) _ v^oo r ^ 

wnere jk — Ck n{k,v) ' ~ 2^p=i j>^p 

r 

TT /C 

Similarly to Section [3] we show that the main contribution to the dis- 
crepancy comes from the harmonics where < < and 
\k\ < T. Therefore the key step in proving Theorem [19] is the following. 

Proposition 20. The set 

{\k\'{k,v) \nT,{k,Tv) mod2,{(fc,a;)},{r|A;|}}|^|^^_^^,|(^^^^lj^^^^_^ ^^^^ 
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converges as T ^ oo to a Poisson process on [—-,-] x (IR/2Z) x (M/Z) 
with constant intensity. 



The proof of Proposition 20 is similar to the proof of Theorem [7| and 



consists of the following steps. 

(a) We prove the Poisson limit for f ) InT} using the ar- 



gument of Section |6.2[ We first normalize one of the coordinates, 
say f3, of the vector t> to 1, which reduces the study of the Pois- 
son limit for f ) InT} to the study of the visits to the cusp 
in M = ^L3(M)/5L3(Z) of gt^ with 

3* 

= I e* I and K{vi,V2) 







-2t 




More precisely, the relevant neighborhood in the cusp is defined via the 
function 

$(£) = + y{uf)^K{{x{uf + y{uf)z{u)) 

udC prime 

with / = [1, e), K = [—^1^, ^T^]- We then define 

u={x,y,z)(iC prime 

and the Poisson limit of {|A;p(A;, f ) In T} is obtained from a Poisson 
limit for 

{^(fl'tA)}te[0,lnT],${<;tA)=l- 

In this setting, the manifold determined by A(t'i, f 2) consists of the full 
strong unstable foliation of gt and there is no need for extra parameters 
to establish the Poisson limit. 

(b) We prove that {k, Tv) mod 2 is asymptotically independent of 
f ) InT using the fact that their values are determined at different 



scales (cf. Section 6.6). 



(c) We show that (A;, a;) and {r|/c|} are independent of the previous 
data using the superlacunarity of the sequence of small denominators 
(cf. Theorem [9]). □ 
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